A Hybrid PSO/ACO Algorithm for Discovering Classification Rules in Data Mining
نویسندگان
چکیده
We have previously proposed a hybrid particle swarm optimisation/ant colony optimisation (PSO/ACO) algorithm for the discovery of classification rules. Unlike a conventional PSO algorithm, this hybrid algorithm can directly cope with nominal attributes, without converting nominal values into binary numbers in a preprocessing phase. PSO/ACO2 also directly deals with both continuous and nominal attribute values, a feature that current PSO and ACO rule induction algorithms lack. We evaluate the new version of the PSO/ACO algorithm (PSO/ACO2) in 27 public-domain, real-world data sets often used to benchmark the performance of classification algorithms. We compare the PSO/ACO2 algorithm to an industry standard algorithm PART and compare a reduced version of our PSO/ACO2 algorithm, coping only with continuous data, to our new classification algorithm for continuous data based on differential evolution. The results show that PSO/ACO2 is very competitive in terms of accuracy to PART and that PSO/ACO2 produces significantly simpler (smaller) rule sets, a desirable result in data mining—where the goal is to discover knowledge that is not only accurate but also comprehensible to the user. The results also show that the reduced PSO version for continuous attributes provides a slight increase in accuracy when compared to the differential evolution variant.
منابع مشابه
Hierarchical Classification of G-protein-coupled Receptors with a Pso/aco Algorithm
In our previous work we have proposed a hybrid Particle Swarm Optimisation / Ant Colony Optimisation (PSO/ACO) algorithm for discovering classification rules. In this paper we propose some modifications to the algorithm and apply it to a challenging hierarchical classification problem. This is a bioinformatics problem involving the prediction of G-ProteinCoupled Receptor’s (GPCR) hierarchical f...
متن کاملA Hybrid DEA Based CHAID and Imperialist Competitive Algorithm for Stock Selection
In this paper, the investment portfolio is formed based on the data mining algorithm of CHAID on the basis of the risk status criteria. In the next step, the second investment portfolio is created based on the decision rules extracted by the DEA-BCC model. The final portfolio is created through a two-objective mathematical programming model based on the Imperialist Competitive algorithm.
متن کاملAssociation Rule Generation by Hybrid Algorithm based on Particle Swarm Optimization and Genetic Algorithm
In data mining, association rule learning is a popular and well researched method for discovering interesting relations between variables in large databases. It analyzes and present strong rules discovered in databases using different measures of interestingness. The process of discovering interesting and unexpected rules from large data sets is known as association rule mining. This refers to ...
متن کاملOptimization Technique of Association with ACO for High Resolution Image Classification: Survey
Data mining is a process of discovering patterns and relationships in data with the help of various data analysis tools, to make valid predictions. Association rule learning which finds the relationships between the variables. Association rules are important features for image classification, mining and rational selection to obtain accurate classification. In this paper, it is an approach to pr...
متن کاملHybrid ANFIS with ant colony optimization algorithm for prediction of shear wave velocity from a carbonate reservoir in Iran
Shear wave velocity (Vs) data are key information for petrophysical, geophysical and geomechanical studies. Although compressional wave velocity (Vp) measurements exist in almost all wells, shear wave velocity is not recorded for most of elderly wells due to lack of technologic tools. Furthermore, measurement of shear wave velocity is to some extent costly. This study proposes a novel methodolo...
متن کامل